RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial 
Source: 

Date Processed by 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: natin21heln@usnto.2Qv or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 c-mail help: patin3help(g),uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a statc-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1 821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST 25 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errorcd sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



# 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



SERIAL NUMBER: 0 C ] j^03 // 



1 Wrapped Nuclcics 

Wrapped Aminos 



The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. *this includes white spaces. 



10 



12 



Misaligned Amino 
Numbering 

Non-ASCII 



Variable Length 



Patentln 2.0 
"bug" 



Skipped Sequences 
(OLD RULES) 



_Skipped Sequences 
(NEW RULES) 



Use of n's or Xaa's 
(NEW RULES) 



Invalid <213> 
Response 

Use of<220> 



Patentln 2.0 
"bug" 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is saved in ASCII text. 

Sequencers) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused Afe <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy th« relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION :SEQ ID NOX: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Scquence(s) 



missing. Ifintentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species) <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 



Sequences) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/0 9/90 3 , 180 TIME: 15:13:29 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\07262001\I903180.raw 
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CoTecV: i 



SEQUENCE LISTING t /OPS N r ( 

5 (1) GENERAL INFORMATION: 

APPLICANT: De Robertis, Edward M. 

Bouwmeester, Tewis 
TITLE OF INVENTION: Endoderm, Cardiac and Neural Inducing 

Factors 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Majestic, Parsons, Siebert & Hsue 

(B) STREET: Four Embarcadero Center, Suite 1100 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 94111-4106 
COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS/MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/903,180 

(B) FILING DATE: ll-Jul-2001 

(C) CLASSIFICATION: 
PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/020,150 

(B) FILING DATE: 20-JUN-1996 
ATTORNEY/AGENT INFORMATION: 

(A) NAME: Siebert, J. Suzanne 

(B) REGISTRATION NUMBER: 28,758 

(C) REFERENCE/DOCKET NUMBER: 3100.002US1 
TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/248-5500 

(B) TELEFAX: 415/362-5418 



[ARACTJE RISTICS ; , \ H rfl 

I: ^97 9 /amino acids ' I 




ERRORED SEQUENCES 

330 (2) INFORMATION FOR SEQ ID NO: 5: 

332 (i) SEQUENCE CHARA£ 

333 (A) LENGTH: 

334 (B) TYPE: amiTTt) acid 

335 ( D ) TOPOLOGY: linear 
337 (ii) MOLECULE TYPE: peptide 

339 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

341 Met Leu Leu Leu Phe Arg Ala lie Pro Met Leu Leu Leu Gly Leu Met 

342 1 5 10 15 

344 Val Leu Gin Thr Asp Cys Glu lie Ala Gin Tyr Tyr lie Asp Glu Glu 
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RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/09/90 3 , 180 TIME: 15:13:30 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\07262001\I903180.raw 
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IRS 

1 O -J 










1 Q0 

17U 






ft "» 7 
o ^ / 


AS ri 


Asn 


i yx 


Asn 


i y i 


V a X 


T 1 p> 
X Itr 


Arg 


Ala 


ii y o 


Va 1 


Lys 


Glu 


He 


Lys 


Thr 


ft » ft 

O ^ O 






1 Q S 










9 0 0 










9 0S 

Z U J 








ft 7 n 


Lys 


is 


ri X b> 


Asp 


Va 1 


Thr 
1 HX 


Hid 


VQl 


V a. 1 


Pin 

ulU 


V Cl 1 


Lys 


Glu 


He 


Leu 


Ly s 


ft VI 




9 10 
ZiU 










Z1J 










9 9 0 
z z u 










ft 3 } 

O O J 


□ CI 




]_j€U 


wax 


Asn 


lit 


r X LJ 


Arg 


Asp 


Thr 


Val 


Asn 


Leu 


T v r 
i y x 


Thr 


Ser 


8 3 4 


2 2 5 










o n r\ 
2 3 U 










1 T C 
Z J J 










2 4 U 


836 


Ser 


Gly 


Cys 


Leu 


Cys 


Pro 


Pro 


Leu 


Asn 


Val 


Asn 


Glu 


Glu 


Tyr 


He 


He 


837 










245 










250 










255 




839 


Met 


Gly 


Tyr 


Glu 


Asp 


Glu 


Glu 


Arg 


Ser 


Arg 


Leu 


Leu 


Leu 


Val 


Glu 


Gly 


84 0 








260 










265 










270 






842 


Ser 


He 


Ala 


Glu 


Lys 


Trp 


Lys 


Asp 


Arg 


Leu 


Gly 


Lys 


Lys 


Val 


Lys 


Arg 


84 3 






275 










280 










285 








845 


Trp 


Asp 


Met 


Lys 


Leu 


Arg 


His 


Leu 


Gly 


Leu 


Ser 


Lys 


Ser 


Asp 


Ser 


Ser 


846 




290 










295 










300 










848 


Asn 


Ser 


Asp 


Ser 


Thr 


Gin 


Ser 


Gin 


Lys 


Ser 


Gly 


Arg 


Asn 


Ser 


Asn 


Pro 


84 9 


305 










310 










315 










320 


851 


Arg 


Gin 


Ala 


Arg 


Asn 
























852 










325 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/90 3,18 0 



TIME 



DATE 



07/26/2001 
15:13: 31 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\07262001\I903180.raw 



L:30 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 

L:31 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:465 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:5 
M:332 Repeated in SeqNo=5 

L:473 M:203 E: No. of Seq . differs, LENGTH : Input : 9 7 9 Found:707 SEQ:5 

L:792 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:9 
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